TRANSLATION INVARIANT EQUATIONS AND THE METHOD 

OF SANDERS 

THOMAS F. BLOOM 

Abstract. We extend the recent improvement of Roth's theorem on three term 
arithmetic progressions by Sanders to obtain similar results for the problem of locat- 
ing non-trivial solutions to translation invariant linear equations in many variables 
CN in both Z/NZ and F^[t]. 
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J5 1. Introduction 

Q This paper concerns solutions to a linear equation in s > 3 variables, 

^ (1.1) CiXi + --- + C,X, = Q^O, 

Lh working inside some fixed ring i?, and we restrict our attention to those equations 

^ with coefficients satisfying ci + - ■ ■ + Cs = 0. These are often referred to as translation 

invariant systems, since it follows that if (xi, . . . ,Xs) G -R'^ is a solution then so is 
{xi + X, . . . ,Xs + x) for any x E R. Following Ruzsa [ ] we define the genus of an 
rH equation of the shape (1-1) to be the largest number m with the property that there 

is a partition of {1, ... , s} into m disjoint nonempty sets Tj where X^jg^^. Cj = 0. Note 
that the genus is well defined and positive, as a consequence of translation invariance. 
J> Given an equation (1.1) of genus m and a finite set A one obtains \A\"^ solutions 

O X e A'^ by setting Xi = Xii whenever i,i' G Tj. We call such solutions trivial, and seek 

,__! an upper bound on the size of sets which contain only trivial solutions to (1.1). In this 

^— I paper we generalise a recent result of Sanders which gives the best known bound in the 

t — case with s = 3 and R = Z/iVZ to obtain quantitative results of comparable quality 

^ for any s > 3 in circumstances where R is either Z/A^Z or the polynomial ring ¥g[t]. 

,__! Despite the analogies between these rings having been well-explored throughout most 

K^ of number theory, the problems of additive combinatorics have been little studied in 

the ¥g[t] setting, and we hope that this paper will encourage others to obtain further 
results of this nature in polynomial rings. We also wish to promote the philosophy 
C^ that ¥q[t] is a useful model case for such problems, acting as a 'halfway house' between 

the currently popular finite field model case F^ and Z/A^Z; technically simpler than 
the latter but capturing more of the interesting behaviour of the integer case than 
the former. 

The problem of finding large sets with no non-trivial solutions to (1.1) may be 
posed in any ring (or indeed any module), but since the work of Roth (see [14, 15]) it 
has received most attention in the integers Z, or rather finite truncations {1, . . . , A^} 
that may conveniently be viewed as the cyclic group Z/A^Z. In the special case 






(N 



X 



Date: January 31, 2012. 

The author is supported by an EPSRC doctoral training grant. 

1 



2 THOMAS F. BLOOM 

s = 3 and (ci, C2, C3) = (1, —2, 1) this is equivalent to finding a three term arithmetic 
progression a,a + d,a + 2d, with non-trivial solutions satisfying d ^ Q. If s > 3 and 
c G {Tj/NTjY is such that (1.1) has genus m>l then we define r{N) = Vs^dN) to be 
the cardinality of the largest subset of Z/A^Z which contains no non-trivial solutions 
to (1.1). Roth showed that r{N) <^s_c A^/loglogiV. This was improved for s = 3 by 
Heath-Brown [ ] and Szemeredi [20] to r{N) <^ N/log^ N for some absolute constant 
c > 0, the value of which was subsequently improved by Bourgain (see [2, 3]), first 
to c = 1/2 — 0(1) and then to c = 2/3 — o(l). Sanders further improved this to 
c = 3/4 — 0(1) in [iV] before the recent breakthrough result of [18], 

(1.) ^('v)«-^^^i:^- 

All of the results mentioned above were obtained only for the special case of three 
term progressions when c = (1,-2,1), but it is straightforward to generalise their 
methods for any translation invariant equation of the form (1.1) with s = 3. 

We expect that as the number of variables s increases non-trivial solutions to (1.1) 
should become easier to find, and hence we should obtain improved bounds for r{N). 
The first result of this paper confirms this expectation, generalising the result of 
Sanders to handle arbitrary s > 3. 

Theorem 1.1. Let s > 3, and suppose c ^ V is such that the equation (1.1) has 
genus m >1. Then 

(log log A^)' 



x5N «-2 



We remark that the implicit constant in Theorem 1.1 in fact depends only on 
i = maxi<j<s|cj|. In [16] Ruzsa showed that if the equation (1.1) has genus m then 
r{N) <^ A^^/™, which is far superior to Theorem 1.1 whenever m > 2. The power of 
the methods used to obtain the bounds listed above, which originated in Roth [11], 
is that they give a non-trivial result in the most difficult case m = 1. When s > 6, 
recent work of Schoen and Shkredov [ ], building on other work of Sanders, shows 
that for some absolute positive constants C and c we have the near-optimal bound 
r{N) <C A^exp(— Clog*^ A^). Theorem 1.1, however, gives the sharpest known bounds 
for s = 4 and s = 5. 

By partial summation we obtain the following corollary. 

Corollary 1.1. If A G 'N satisfies X^ae^*^"^ ~ ^^ then A contains infinitely many 
non-trivial solutions to every translation invariant equation of the form (1.1) having 
s > 4 variables. 

For comparison, a conjecture of Erdos asserts that if XlaeA ^^^ ~ '^ then A must 
contain infinitely many arithmetic progressions of length k for any A; G N. In the case 
A; = 3, this would follow were Corollary 1.1 to hold with s = 3, which would in turn 
follow if one could improve the (log log A^)^ factor in Theorem 1.1 to (loglog A^)~^, 
for example. 

Another popular setting for the problem of finding solutions to the equation (1.1) is 
F^, an A^-dimensional vector space over the finite field ¥p. Here we have coefficients 
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Ci G ¥p and we define rs,c{N) to be the cardinality of tlie largest subset of F^ that 
contains only trivial solutions to the equation (1.1). The best known bounds here 
are superior to those in Z/iVZ. Thus, for example, the simple method of Meshulam 
[13] yields r{N) -C p^/N, a conclusion comparable to r{N) <^ N/logN in the 
integer case. Furthermore, Bateman and Katz [ ] have recently improved this upper 
bound to show that for some absolute constant e > 0, one has r{N) -C p'^ /N^~^'^. 
The investigation of such problems in F^ has recently become popular since it is 
often conceptually easier to prove results in F^, taking advantage of the vector space 
structure, and then 'translate' the methods to the more difficult case Z/A^Z (see [~] 
for a comprehensive discussion of this technique). 

One setting that has received comparatively little attention is Fq[t], the ring of 
polynomials over a finite field; as in the case of Z it is more convenient to deal with 
finite truncations of this infinite ring, so we shall work in the additive subgroup of 
polynomials with degree strictly less than A^, which we shall denote by Gn. There are 
many well-known analogies between number theory in Z and in ¥q[t], and theorems 
in one often have a natural counterpart in the other. We should expect, in particular, 
some result analogous to (1.2) to hold for solutions to the equation (1.1) where c G 
Fg[t]'^. Perhaps part of the reason why this problem has been largely overlooked is that 
Gn is additively isomorphic to F^, since both are A^-dimensional vector spaces over 
Fg. Hence in the classical case, with s = 3 and c = (1, —2, 1), the strong bounds of the 
F^ setting are available. In general, such bounds are available whenever c G F^, where 
p is the characteristic of ¥g. In the Fg[t] setting, however, this restricts all coefficients 
to have degree zero, and hence trivial size under the usual valuation \a\ = q'^'^^"-. 
Viewed in this way it is less surprising that we can obtain such impressive bounds 
with such a strong restriction on the coefficients of our linear equation. Note in 
particular that such issues of triviality do not arise in the Z/A^Z case, since s > 3 
and translation invariance forces at least one of the coefficients of the equation (1.1) 
to have non-trivial size. 

When s > 3 and c G Fq[t]'', denote by r{N) the size of the largest subset of Gn 
which contains no non-trivial solutions to the equation (1.1). The sharpest bound for 
r{N) currently available in the case where c G F^ is due to Liu and Spencer [l l] who 
showed that r{N) <^ q^ /N'^~'^. The second result of this paper adapts the method of 
Sanders to give a comparable bound even when the coefficients q have large degree. 
As above, this result gives a bound which improves as s increases. 

Theorem 1.2. Let s > 3, and suppose c G Fq[t]* is such that the equation (1.1) has 
genus m> 1. Then 

r„(7V)«„,"A>og/Vr- 



where i = max(degCj). 

Note here that the implicit constant depends only on the highest degree of the 
coefficients q. It also depends on q, the size of the constant field. Throughout this 
paper, this will be considered fixed as will s and c, and all constants may depend on 
these. 
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It is not difficult to get some kind of quantitative bound for r{N) even when the 
coefficients have large degree, and indeed the original method of Roth can be adapted 
with a little effort to obtain r{N) <^ g^/ log A^. Theorem 1.2 goes beyond this to show 
that we can obtain a quantitative bound comparable to the best known in Z/iVZ. 
Indeed, in ¥g[t] we are able to do slightly better (by a factor of logA^) than the 
analogue of (1.2). This improvement is a consequence of the vector space structure 
of Fg[t] which we are able to exploit even when the coefficients have large degree. 

We remark that the methods of [J (i] are easily adapted to the Fg[t] setting to give 
r{N) -C g^/™- whenever (1.1) has genus m, a conclusion sharper than that of Theorem 
1.2 whenever m > 2. The strength of Theorem 1.2 is that it applies when m = 1, 
which appears to be the most difficult case. 

Since this paper was originally written, we have been made aware of recent similar 
results by Liu and Zhao [12]. By adapting the methods of [ ] and [ ] to the Fg[t] 
setting they show, under the same hypotheses as Theorem 1.2, that 



,(A^) <^s,i q 



N I i^ogNf 



N 



This is weaker than Theorem 1.2 for all s > 3, as it does not use the new techniques 
introduced in [IS]. We note, however, that the mechanisms they use to translate ideas 
from the Z/iVZ setting of [2] and [3] to Fg[t] are very similar to those introduced in 
this paper. In particular, they also give the proper analogue of Bohr sets in the Fg[t] 
setting. 

The main ideas and techniques used to establish Theorems 1.1 and 1.2 are those of 
the original argument in [' ""], but we are able to make many technical simplifications in 
Fq[t], largely thanks to the fact the analogue of 'Bohr sets' are closed under addition. 
Indeed, we hope that this paper may also serve as an exposition of the methods in 
[18], for in ¥q[t] the fundamental ideas are less obscured by technical difficulties. We 
further hope that this paper will indicate how to translate many results of additive 
combinatorics to an analogous result in Fg[t], which we believe to be a technically 
simpler model for such results. 

In recent years there has been a focus on obtaining such results in F^ and then 
translating the methods used so as to obtain a result in Z/A^Z. We believe that the 
translation from F^ to ¥g[t] and thence to Z/A^Z is simpler to perform and more 
intuitive. We are confident that this new method will stimulate further progress in 
other problems of additive combinatorics. 

Thus, for example, the approach taken in this paper is to ffist establish Theorem 
1.2, the Fq[t] case, and then sketch how to adapt the proof to obtain Theorem 1.1, the 
Z/A^Z case. In Section 2 we explain the notation used in the rest of the paper and 
prove two useful lemmas. Sections 3 and 4 provide the two tools used in the approach 
of Sanders [Is], namely a combinatorial transformation of sets inspired by the work 
of Katz and Koester [9] and an application of the useful probabilistic theorem due 
to Croot and Sisask [H]. Section 5 combines these tools in the 'density increment' 
strategy so as to conclude the proof of Theorem 1.2. Section 6 then sketches a proof 
of Theorem 1.1. Here we are brief, since the proofs are the same as those for Theorem 
1.2 except for routine technical changes. 
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2. Notation and preliminaries 

Throughout the rest of this paper, let N be some fixed large integer and ¥q be 
the finite field with q elements. We shall write G = G^ = {x G ¥q[t] : degx < N}, 
and note in particular that G is an A^-dimensional vector space over F^. We will 
frequently use the expectation notation 

x£X I I xeX 

Unless denoted otherwise, all expectations are taken over G. By a convenient abuse 
of notation, we use the same letter for the set and its characteristic function. When 
considering C C G, we use fie to denote the measure induced by G, so that 

and for A G G, 

fic{A) = {iic.A)= ' . 

For notational convenience we will also use (5 to denote /i^- Given a subspace B G G 
we will denote subsets of B by capital Roman letters, and their density within B by 
the corresponding lower case Greek letter, so that P{A) = a, for example. We draw 
special attention to our novel notation for the balanced function of a subset A G B, 
defined by 

{I — a ii X E A, 
-a ii X e B\A, and 
otherwise, 

so that E,x(zbA{x) = 0. For any functions /, (^ : G — ?> C we define 



and similarly, for any 1 < p < oo. 



,^ ^x] 

x&X 



p(Mx) 

xeX 



]El/(x)r) ' andl|/lU(,,) = sup|/(x)|. 



xex 



Where the measure is omitted these expectations should be taken over G. There is 
an analogue of Fourier analysis in the Fq[t] setting. If a; = 'Ylii<,N^i^^ ^ ^g((l/^)) then 
we define 

e{x) := exp(27riTr(a_i)/p) 

where Tr : Fg — )■ ¥p is the familiar trace map. We may now define a character on G 
as the map x i— )■ e{C,x) where ^ is a member of T, the dual group of ¥q[t] which is 
defined by 



T= i^a,f :a, gF, 



i<0 
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The usual definitions and results of Fourier analysis apply in this case, so we define 
the Fourier transform of / by 

xeG 
and we have Parseval's identity 

where G is the dual group of G defined by 

G = {a^it-^ + ■■■ + a^Nt~^ : tti G Fg, -N < i < -1}. 

For a more comprehensive discussion and detailed proofs for Fourier analysis over 
Fg[t] we refer the reader to [10]. In most cases, when we use the convolution operator 
it will be with respect to the measure over some subspace B (which will be clear from 
the context), and hence we define 

f*g{x) ='^f{y)g{x-y). 

y&B 

In some cases, g will be a measure fic for some other set C, and there we define 

f*fic{x) = ]g/(a;-y). 

yec 

Note in particular that when convolving with a measure, the convolution itself is 
always global (i.e. over the entire group G). As usual, the Fourier transform converts 
convolution into multiplication, with an appropriate scaling factor: 

^g(5)7^(o = mm- 

This has the useful corollary, frequently used without mention in what follows, that 

For finite F C T and k : F — )■ N we define the Bohr set -Bre(F) to be the set 

{xeG: deg{a;^} < -m for all ^ G F} 

where if a; = Ylii<N'^i^^ ^ ^^g((lA)) then {x} = '^i^oCat''. We call the size of F the 
rank of B, denoted by Tk{B), and call k = max^gr ^(0 the width of B. We note the 
easy estimate 

Furthermore, note that B^{r) is itself a vector space over ¥g. If B = -Bk(F) is a given 
Bohr set then by B^ we denote the Bohr set BK+fc(r)- 

This is analogous to the traditional Bohr set in the context of Z/A^Z, 

x^ 



Bohrp(F) = <x: 



N 



< p for all ^ G F 



where || ■ || measures the distance to the nearest integer and F is a subset of Z/NZ. 
For both concepts, the idea is to find a set on which a given set of characters is 
'approximately' trivial. This is made easier in our setting of Fg[t], since a character 
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can only take finitely many values even as A^ — )■ oo, unlike the Z/A^Z case. The 
significant observation here is that the other important feature of Bohr sets is that 
they are approximately closed under dilation by the coefficients of our linear forms; 
in the ¥q[t] case this follows from the inclusion A ■ i?K+m(r) C Bi^{r) which holds 
whenever deg X < m. We also desire that they be, at least approximately, closed 
under addition. This has obscured the dilation preservation feature in the traditional 
settings of Z/A^Z and F^, since this follows from closure under addition using crude 
estimates such as 2 ■ A G A + A. 

In the density increment procedure used to prove Theorem 1.2 we will need to 
dilate a Bohr set by some c G ^q[t] and ensure that this dilate is itself a Bohr set, 
so that the next iteration can be performed. The following lemma shows that this 
is true, and that the rank of this new Bohr set can only increase by some bounded 
amount. Recall that Gm denotes the set of polynomials in ¥q[t] with degree strictly 
less than m. 

Lemma 2.1. If c E ¥q[t]\{0} and B = B^iT) is a Bohr set with width k such that 
B C GAT-dcgc then c ■ B is a Bohr set with rank at most rk{B) + q'^'^^'^ and width k. 

Proof. Choose c^^ G Fg(t) such that c^^c = 1 and let T' = c^^(T U Gdegc)- Let 
f = {{x} : X G r'}. We have f C G and |f[ < |r'| < |r| + g^''^^, so to complete 
the proof it remains to show that c ■ B = Bf^i{r), where k' defined by k,'{^) = ^(7) if 
C, = c^'^'~f for some 7 G F and n'{C,) = 1 otherwise. 

By the orthogonality relationship of the exponential function (see Lemma 1 in [ ]) 
we have the identity 

y e(c-^/x) = |° when cfx, and 

77^ I g™sc when c | x. 

By the definition of e(-), however, we also have X^iec e{c~^lx) = g'^'^^c jf g^^^j only 
if Tr(c^^/a;) = for all / G Gdegc? which is true if and only if deg{c~^lx} < —1 for all 
/ G Gdegc since Fg ■ Gdegc = Cdegc- It follows that c | x if and only if deg{c~^/x} < — 1 
for all I G Gdegc- If we have some y G -Bfc(r), therefore, then we must have y = ex for 

some X G GAr_degc- 

Furthermore, for all 7 G F we have deg{x7} = deg{yc^^^} < —k and hence 
-Bfc(F) G c ■ B. Finally, ii x E B then for any 7 G F we have deg{cxc~^7} = 
deg{x7} < —k and for any / G Gdcgc we have {cx{c^^l}} = {cxc^^l} = so certainly 
deg{cxc~^/} < —1, so that ex G -Bfe(F) and hence Bk(T) = c ■ B as required. D 

We will frequently use both the big-0 notation and the Vinogradov ^ symbol, 
where the implicit constant depends on at most s, i and q. We will also sometimes 
use a c or C to denote some positive constant, again depending on at most s, i and q, 
which may vary from line to line. All these constants may be made explicit, though 
we do not do so here. 

We shall now define the concepts of spectra and symmetry sets. We define the 
?7-spectrum at G of a function / to be 

A,if) = {^^G:\m\>v\\f\\i}, 
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As above, we suppose B is some subspace of G which shall be clear from context, and 
for any L,K (Z B we define the ?7-symmetry set to be 

Sym^(L, K) = {xeB:L* K{x) > r]}. 

Note that Sym (L, —K) = — Sym (— L, K). We will need some structural information 
on spectra due to Chang [I]. The following version is proved as Lemma 4.36 in [:'l]. 

Lemma 2.2. Let G be any finite additive group and D <Z G with density 5. If rj > 
then there is a subset A C A^(D) such that |A| ^ rj'"^ log{l/6) and 

A,(D)C{-1,0,1}I^I-A. 

We now use this to convert the fact that we have large Fourier coefficients over a 
large spectrum into a more useful density increment property. The idea goes back to 
the work of Heath-Brown [S] and Szemeredi [20] , and was developed into the following 
form by Sanders [ ]. 

Lemma 2.3. For any Bohr set B G G and rj > if A, D G B satisfy 

J2 |A(7)P>z/aVG(i?) 

then there is a Bohr set B' G B with 

rk{B') < rk{B) + 0{7]-'^\og{l/6)) and /3{B') > exp{-G7]'^\og{l/6)) 
such that \\A * P'\\oo > «(! + i^)- 

Proof. Let B G G he the dual group of B viewed as an additive group. It is easy to 
check that for any 7 G G there is some 7' G -B such that 7(0;) = 'y'{x) for all x G B. 
Let A' = B D Ari{D). Renormalising the Fourier transform, we see that A' is the 
?7-spectrum at B for D. Let A C A' be the set given by Lemma 2.2, and refine the 
Bohr set B by setting 

B' = {x G B : deg{a;7} < -1 for all 7 G A}. 

It follows that 

rk(5') < rk(fi) + I A| < rk(5) + 0(7/-^ log(l/5)), 

and hence P{B') > exp(— C?]^^ log(l/(5)). Furthermore, for all 7 G A^(D) we have 
P'{'~f) = K^^B'^i^'j) = 1- By Parseval's identity and the initial hypothesis 

||A*/3'||^ = 5^|A(7)n^'(7)P> J2 |A(7)P>^«VG(i?). 
-red 7ga^(d) 

Furthermore, note that B*(3'{y) = E,x^B'B{y — x) = B(y), and hence {B*(3', B*(3') = 
Hg^B). Similarly, 

{A * /?', S * P') = ^mg{B) E E ^(^ - ^) = "'"^(S)- 
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In particular, 

\\A*(5'\\l = \\A*(3'\\l + a^{B*(3',B*/3')-2a{A*P',B*/3') 
= \\A*f3'\\l-a^fiG{B). 

It follows that \\A * P'Wl > a^(l + u)iig{B). Combining Holder's inequality with the 
equality \\A * [3'\\i = aficiB) gives \\A * /3'||oo > «(1 + ^) as required. D 

We make one final remark on the values of the densities a of the various A G G 
we consider, where these A will be sets with no non-trivial solutions to the equation 
(1.1). Recall that we may assume a < r{N) <^ 1/logA^ by adapting the method of 
Roth. In particular, for sufficiently large A^, we will assume that a < 1/e^, so that 
log(l/a;) > 2. Making this assumption avoids some notational awkwardness. 

3. A COMBINATORIAL TRANSFORMATION 

We begin with a generalisation of one of the two main tools in the method of 
Sanders, a combinatorial transformation. This has its origins in the work of Katz 
and Koester [9], though a similar transformation known as the Dyson e-transform 
has been a useful tool in additive combinatorics since [ ]. 

The idea is to find, given subsets Ai,...,Ak+i of a Bohr set B, corresponding 
L, Si, . . . ,Sk C B such that L is dense inside B, the Si are not too sparse and 

L * Si* ■ ■ ■ * Sk < (x\^ Ai * ^2 * ■ ■ ■ * Afc+i- 

Since solutions to (1.1) are counted by ((ci ■ A) * • • ■ * {cs~\ • A), Cs ■ A) to find many 
solutions to (1.1) in A it then suffices to find a lower bound for (L * ■ ■ ■ * Ss-2-, Cg- A). 
The large density of L may then be efficiently exploited using the recent probabilistic 
method of Croot and Sisask, as we shall do in the next section. When we cannot find 
such sets we find a strong density increment and jump to the final stage of the proof. 
We first give a simplified proof of Lemma 4.2 from [18], taking advantage of the fact 
that our Bohr sets are subspaces in order to streamline the proof. We also observe 
that the proof gives a slightly stronger statement than is recorded in [18]. We then 
apply this technical lemma iteratively to construct sets L, 5*1, . . . , S'^ as above to prove 
the main theorem of this section. This is a generalisation of Proposition 4.1 from [18], 
taking advantage of the large number of convolutions to run a more efficient iterative 
procedure. 

Lemma 3.1. Ij B is a Bohr set and K,T,L, S G B then either 

(1) there exists a Bohr set B' G B with rank at most rk{B) + 0(Ak~^ log(l/r)), 
density f3{B') > exp(— CAk""*^ log(l/r)) such that, for some x G B, f3'{K + 
x) > k/?>2X, or 

(2) there are L', S' G B with /3{L') >\ + k/2 and /3{S') > Ta/2 such that, for all 
X G G, 

L' * S\x) <L* S{x) + K* T{x). 

Proof. For an arbitrary x G B, let L^ = L U {K + x) and S^ = 5" fl (T — a;). By 
construction, 

L^ * S^ < L * S^ + {K + x) * S^ < L * S + K *T. 
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Furthermore, we have 

P{L^) = X + K-/3{Lr]{K + x)) = X + K-L* {~K){x) 

and (3{Sx) = {—S) *T{x). Hence if |Sym^^/2(~'S', T)| > |Sym^/2(-^) ~-^)l then we are 

in the second case by taking L' = L^ and S' = S^ for some x G Sym^^^gl^'S', T)\Sym^/2(-^) ~^)- 

Note that 

/3{Sjm^^/,{-S,T))a + ar/2 > ^{S) * T{x) = ar, 

xeB 
so that /3(Sym^^/2(~'S', T)) > r/2. Hence either we are in the second case of the 
theorem, or we may assume that D = Sym^^gl"-^; -^) has density at least r/4, say. 
We have 

{{-L)*K,D)p>k6/2. 

Note that 

{{-L)*B,D)^ = X6. 

We may assume that A < 1/4, say, or else the conclusion is trivial. Hence, by the 
triangle inequality, 

\{{-L)*K,D)fs\ > k6{1/2-X) >k6/4. 

Taking the Fourier transform of the left hand side we see that 

The Cauchy-Schwarz inequality combined with Parseval's identity then gives 

^|K(7)r|5(7)r > «:V/.G(i?)Vl6A. 

Furthermore, if we let rf = k,/32X then by Parseval's identity again we have 

J2 |K(7)n5(7)r<«:^<5VG(5)V32A. 

Using the trivial upper bound |-D(7)| < djiG^B) it follows that 

Y, |K(7)P>«:Vg(5)/32A. 

7eA^{D) 

Applying Lemma 2.3, we have a Bohr set B' d B with rank and size as required 

and (5'{K + x) > k/?,2X for some x eX. U 

We now apply Lemma 3.1 iteratively to prove the main theorem of this section. 

Theorem 3.1. Let B he a Bohr set and suppose Ai d B with density ai and 
A2, . ■ ■ , Ak+i C B all with density at least a. Then either 

(1) there exists a Bohr set B' C B with rank rk{B') < rk{B) + 0{ai log(l/a)), 
density (3 [B') > exp(— C«]^ log(l/a)) and, for some x E B, we have (3'{Ai + 
x) > 2a\, or 
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(2) there are sets L G B and Si, . . . ,Sk C B with A > 2^*^"^ and cji > a;*""i for 
1 < i < k such that 

L* Si* ■ • • * Sk{x) < a{^Ai * ^2 * ■ ■ ■ * Afc+i(x) 

for all X E G. 

Proof. Let Lq = Ai. We show by induction on 1 < j < A; that we may either find 

Xi, . . . ,Xj d B such that Lj = Ai+ Xi + ■ ■ ■ + Xj satisfies 

Xj > max(ai|Xi| ■ ■ ■ \Xj\/2^ ,a~^^'' /2^) 

and Si, . . . , Sj C B satisfying o"j > a*"°i for 1 < i < j such that 

Lj * Si* ■ ■ ■ * Sj * Aj^2 * ■ ■ ■ * Ak+i{x) <a\ Ai* A2* ■ ■ ■ * A^j^iix) 

for all X G G, or there is a Bohr set B' C B such that rk(5') < Tk(B)+0{ai log(l/a)) 
and for some x & B we have /3'{Lj_i + x) > 2ai\Xi\ ■ ■ ■ \Xj_i\. If we are in the second 
case for any 1 < j < k then we halt the inductive procedure and by the pigeonhole 

principle we must have, for some x, G Xj, f3'{Ai +Xi-\ \-Xj^i + x) > 2ai, giving us 

the first case of the theorem. Otherwise, setting j = k and L = L^ gives the second 
case of the theorem. 

We now fix 1 < j < fc and suppose that we have produced Lj_i and Si, ... , Sj-i 
as above. We produce a sequence of sets Lj, Sj C B iteratively such that Xj-ii > 
Af > Xj-ii/2, af > {a/2y, and if * sf < iLj^i * Aj+i. We begin the iteration 

by letting Lj := Lj_i and Sj := Aj+i. 

We now repeatedly apply Lemma 3.1, with K = Lj_i and T = Aj+i, and note that 

(i+i) _ j^ii 



an examination of the proof shows that we take L^ = L^ U {Lj_i + x^+i) for some 



Xj+i G -B so that L^ = Lj^i + X where |X | < i. If we can continue the process 
for i' = \a^ 1 steps, then we can halt and set Lj = L^ , so that Lj = Lj_i + Xj 



f =Vi+xf where |xf 

where 

A,- > A,-_i rar'^1/2 > max(ai|Xi| ■ ■ ■ |Xj|/2^ al-'^''/2^) 

by inductive hypothesis. Furthermore, if we set Sj = S* then aj > (a;/2)* > a'^"^ 
and by inductive hypothesis again, 

Lj* ■■■ * Sj* Aj^2 * ■■■ * Ak+i < [a^ 1-^j-i * ■ ■ ■ * Sj^i * Aj^i * ■■■ * A^+i 

< a^ ^' Ai* A2* ■ ■ ■ * Ak+i 

as required. 

Otherwise, we must halt at some i' < a^ with the first alternative from Lemma 
3.1, so that there is a Bohr set B' G B with rank at most 

rk(5) + 0(Af Ut_\ log(l/a)) < rk(5) + 0(a^'/' log(l/a)) 
and density l3{B') > exp(— Ca^^ log(l/a;)) such that, for some x E B, 

(3'{Lj_i +x)> A,_i/32Af ^ > ailXil ■ ■ ■ \Xj_i\/V+''Xf^ > 2ai\Xi\ ■ ■ ■ |X,-_i| 



12 THOMAS F. BLOOM 

since if A^* > 2^^^^ > 2^'^^^ we may end the entire procedure with the second case 
of the theorem. This completes the induction. D 

4. The Croot-Sisask probabilistic method 

We now use the new Croot-Sisask method of random samphng from ["] to find 
a density increment that can exploit the fact that one set in a convolution is very 
dense, while allowing us to ignore the density loss in the other sets that results from 
applying Theorem 3.1. The operator Tt is defined by Tt{f){x) = f{x — t). The theorem 
finds a large set T such that the translation operator Tt is essentially constant on the 
convolution L* Si* ■ ■ ■* Sk for all t E T. This set may have no structure initially, but 
Croot and Sisask observed that we may recover some structure by taking repeated 
sumsets of T. In [ ] this was exploited to great effect as in the proof below. 

We first state the result from [ ] that we require, before proving an analogue of 
Corollary 5.2 of [ ] for multiple convolutions. The proof given here is very similar, 
but once again we are able to present a simplified and streamlined version using the 
subspace structure of our Bohr sets. 

Theorem 4.1. Let B he an additive group, S G B and / : S — )■ C any function. 
Then for any e > and p >2 there is some T G B with /3{T) > a^'^ ^ such that 

WnU *l^s)- f* /i5||p{/3) < e||/||p(/3) 

for all t G T. 

This is a global version of Theorem 5.1 from [18], which we are able to apply here 
because Bohr sets in ¥q[t] are additive groups. When they are only approximately 
closed under addition, as in Z/A^Z, the local theorem of [IS] must be used. The 
following theorem applies this to obtain a useful density increment. 

Theorem 4.2. Let B be a Bohr set, and suppose A, L, Si, . . . ,Sk G B. Then either 

(1) {L* Si*---* Sk, A)is > AcTi ■ ■ ■ o-fca/2, or 

(2) for any integer I there exists a Bohr set B' G B such that 

rk{B') < rk{B) + 0{X-^-^/'a-^/HHog{l/a)\og{l/ak)), 

and, for some x G B, we have f3'{A + a;) > a{l + cX). 

Proof. Applying Theorem 4.1 with parameters p and e to be chosen later, where 
f = L * jj^Si * ■ ■ ■ * I^Sk-i^ there is a set T G B such that \T\ > o"^, ^ ^\B\ and 

\\n{L * fisi * ■ ■ ■ * fiSk) - L * fis^ * ■ ■ ■ * fJ'SkWpW) < 4L * fJ'Si * ■ ■ ■ * fJ'Sk- 

< e||L||p(^) 

for all t G T, or after rescaling, 

\\Tt{L * Si* ---* Sk) - L* Si* ---* Sk\\p(f}) <eai--- (TkX^^^ < eai ■ ■ ■ dk- 

By the triangle inequality, for any / G N and t G IT, 

\\Tt{L * Si * - - - * Sk) — L * Si * - - - * Sk\\p(i3) < ^ecTi ■ ■ -crk- 
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li we let g := fix * ■■■ * fJ'T with / copies of /it, then after recalhng the definition of Tt 
and applying the triangle inequality again we get 

\\L * Si * ■ ■ ■ * Sk * g - L * Si * ■ ■ ■ * Sk\\p{i3) < lecTi ■ ■ -ak- 

By Holder's inequality, 

\{L* Si* ■■■ * Sk* g,A)^- {L* Si* ■ ■■ * Sk,A)^\ < leai ■ ■ ■ ak\\A\\p/p^m3) 

< Ao"i • • • a^a/A 

if we let p = [log(l/a)] and e = X/Ael. Hence either we are in the first case, or 

{L * Si * ■ ■ ■ * Sk* g, A)j3 < SAcTi • ■ ■ crfc«/4. 

Since ^xesg = 1 we have 

'^L* Si* ■ ■■ * Sk* g{x) = AcTi ■ ■ ■ ak- 



xeB 



Hence 



\{L * Si * ■ ■ ■ * Sk* g,A)i3\ > AfTi ■ ■ ■ crfca/4. 

We complete the proof by a standard conversion into Fourier space, as in the proof 
of Lemma 3.1. Taking the Fourier transform of this inequality we see that 



J2mSiii)---Skii)g{i)A{l) 

■yed 



>Xai---akai2G{Bf+'/A. 



Using the Cauchy-Schwarz inequality combined with Parseval's identity and the trivial 
bound 15^(7)1 < aificiB) gives 

^|^(7)A(7)|'>AaVG(5)/16. 

Recalling the definition of g this is 

5^|/I?(7)|''|A(7)|'>AaVG(i^)/16. 
7eG 

Furthermore, if we let ?7^' = Aa/32 then, by Parseval's identity, 

J2 1/^(7)1'' |A(7)f<AaVG(i?)/32. 

7^A„(T) 

Combining these inequalities we have 

J2 |A(7)f> E 1/^(7)1'' |A(7)f>AaVG(i?)/32 
7GA^(r) 76A^(r) 

and the second case follows from Lemma 2.3. D 
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5. Density increment strategy 

We now prove Theorem 1.2 using the traditional density increment strategy: we 
construct a series of Bohr sets B^^^ such that at each stage, either A fl 5'-*^ has many 
solutions to (1.1), or we may find a new Bohr set i?(*+^) c -B^*^ on which the density 
of A increases. Since the density is always trivially bounded above by 1, we can only 
iterate this a bounded number of times before we must end with the first alternative, 
from which we deduce the existence of many solutions to (1.1) in the original set 
A, and hence deduce that A must contain at least one non-trivial solution if a is 
sufficiently large. 

We define A{A) by letting A(A)|G|*~^ be the number of solutions (xi, . . . ,Xs) to 
(1.1) with Xi G A. Note in particular that we include the trivial solutions so that 
A{A) > \A\'^\G\^~^, where m is the genus of (1.1). 

We first combine the results from the previous two sections to obtain a density 
increment which we then iterate to provide a lower bound for A{A), from which we 
deduce Theorem 1.2 as a corollary. 

Lemma 5.1. Let B be a Bohr set and Ai G B for 1 < i < s, all with density at least 
a. Then either 

(1) {A,*A,■■■*A,.^,A,)p:^a^--'^''~'\ or 

(2) we have a Bohr set B' C B of rank rk(B) + 0(a;^"'^/''*^^-*(log(l/a;))^) and size 

such that for some x E B and i G {1, s} we have P'{Ai + x) > a(l + c). 

Proof. By Theorem 3.1, either we are in the second case of the theorem or there are 
L,Si,..., Ss-2 C B with A > 2-'-^ and a^ > a^""'^^'""' for 1 < i < s - 2 such that 

Ai * A2- ■ ■ * As_i > a^L * ^i * ■ ■ • * 5's_2- 

We may now apply Theorem 4.2 to get that either for any integer / we have a Bohr 
set B' G B such that 

rk{B') < Tk{B) + 0{a-^/^'-^^-^^HHog^{l/a)) 

and, for some x E B, we have (3'{As + x) > a{l + c), or 

{L*Si*---* Ss^2, As)p > ai ■ ■ ■ a,_2Aa/2 > a^^"''''"\ 

Combining this with the bound above, we are either in the first or second case of the 
theorem, after taking a roughly optimal choice of / = [log(l/«)'|. D 

We now prove the following generalisation of Theorem 1.2 using the traditional 
density increment strategy. 

Theorem 5.1. Let s > 3, and suppose that c G ^q[tY satisfies the condition that 
(1.1) has genus m > 1. Then there exists a constant C > 0, depending only on s, q 
and i = max(degCj), such that if A C G then 

A{A) > exp(-Ca-^/(^-2)(log(l/a))^). 
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Proof. We shall iteratively construct a sequence of Bohr sets B^^^ and sets A^^^ C G, 
each a subset of some translated and dilated copy of A. Let ik{B^^^) = S-'^'> and 
^(«)(^yl(*)) = Q,(«)_ We shall insist that, given i?^*^ and A^''\ we can either find a i?(*+^) 
and A^^^^'^ such that for some constant c' > (depending only on s, d and g), the 
density satisfies a'^*"'"-^) > «(*)(! + c'), 

and 

or we have 



A(A) > exp(-Crf«)/iG(5^* 



)^«-i«^" 



-l/(s-2) 



We begin the iteration by letting E^^^ = Gat-^^ so that d^'^^ < q"'^ and A^^) = A'flE^^) 
where A' is some translate of A chosen by the pigeonhole principle so that a^^^ > a. 
This is permissible since the conclusion of our theorem is invariant under translation. 
Since we always have the trivial bound a*-*^ < 1, we can iterate this process only 
0(log(l/a)) times, hence at some B^^\ with K <^ log(l/a), we have the second 
alternative. Here we have 

i=l i=0 

Similarly, 

from which the result follows. 

We now discuss the iterative procedure. We are given a Bohr set B^^^ and a set 
^(«)_ To reduce notation in what follows, let A = A^'^^ fl B^^^ and B = B'^^K We have 
A C B with density 5 = a^^\ where i? is a Bohr set with rank d and width k. Note 
that for any B' G B we have E^g^A * P'{x) = K^^b'A * /3{x) = a. In particular, for 
1 ^ i ^ s define Bj = Ci- ■ ■ Cj_iCj+i ■ ■ ■ Cg ■ B^i. Since Bj C B for all 1 < j < s, we 
have 

]g(I*/3i + --- + I*/3,)(x) = s5 

xeB 
and in particular there is some x E B such that 

A* Pi{x) H h A*/3s(x) > s5. 

Note that if 

for any 1 < J < s then we may proceed to the next stage of the iteration, letting 
5(«+i) = ]^. and yi(«+i) = yl(«) _ a;. Here we use Lemma 2.1 which ensures Bj is 
a genuine Bohr set, after noting that since we chose -B*-^^ = G^^se the condition 
Bse C GAr_dcg{ci--c _ic +i---c4 will always be satisfied. Otherwise we must have A * 
fij{x) > 5 (1 - f) for all 1 < j < s. 
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Hence if we let B' = ci ■ Bi = ■■■ = Cs ■ B^, and similarly Aj = Cj{A^'''^ — x), then 
we have (3'{Aj) > 5 (l — |) for 1 < j < s. By Lemma 2.1 we see that B' is a genuine 
Bohr set and furthermore B' C B. We now satisfy the conditions of Lemma 5.1; 
hence either 

or there is a Bohr set B" C B' of rank d + 0(a^"'^/^*~^''(log(l/a))^) and density 

(3iB") = (3{B')(3\B") > exp{-Cid + a-'/^'-'\\ogil/a)Y)) 

such that either Ai or As has density increment at least 5(1 + c)(l — |) > 5(1 + |). 
In this case we are done, letting i?(*+i) = B" and A^^^^^ be Ai or A^ as appropriate. 
Otherwise, note that 

A{A) > fiGiB'y-'{A, *A2---* A-i, As)p^. 

Here is where we make essential use of the translation and dilation invariance condi- 
tion ^ Cj = 0, for the inner product on the right hand side only counts solutions in 
a set which may be translated and dilated many times from our original A, but each 
solution can be retranslated and dilated to give a solution in the original set. Putting 
this all together and noting the estimate (3{B') > exp{—Cd) we have 

A{A) > /iG(5')^-'a^""'^''"'' > exp{-Cd)fiG{By~'a^''~"^^'''^ 

as required. D 

We finally prove Theorem 1.2 as a corollary. 

of Theorem 1.2. Suppose A G G with density a has no non-trivial solutions to (1.1), 
so that A{A) = a'»g{i+"»-^)^. Theorem 5.1 then implies N < a-^/^'-^\\og{l/a)y. 
If a <^ A^~*+2 then we are done; otherwise, we have log(l/Q;) ^ log A^, and hence 



^^ , (logiV) 



4\ s-2 



N 
as required. D 

6. The integer case 

We now sketch how to prove Theorem 1.1 using the ideas from previous sections. 
The results here will be much closer in spirit to [ i '-^] , and we focus on explaining the 
process of translating a proof in ¥q[t] to one in 7j/N7j. 

Given p > and F C Z/A^Z we define the Bohr set -Bp(r) to be 

Bp(T) = {x e Z/NZ : \e{-fx) - 1| < p for all 7 G F} 

where we are now using the character given by e{x) = exp{27Tix/N). As in the Fg[t] 
case, we define rk(i?) = |F| and call p the width of 5p(F). We have the analogous 
size estimate |i?p(F)| > pI'^'A^. Further discussion and proofs can be found in Chapter 
4 of [21]. 

The reason for the extra technical complications in the Z/NZ case is that it is no 
longer true that B + B = B ioi a Bohr set B. In general, the best we can do is 
Bp{r) + Bp(r) C i?2p(F), which may be exponentially larger. The key insight here. 
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originating with Bourgain [ ], is that if we restrict ourselves to instead consider B + B' 
for some smaller Bohr set B' G B, and if B is sufficiently well-behaved, then we know 
that B + B' is mostly contained within B, and this is enough to run the previous sort 
of arguments while incurring only a small error term. 

To make this precise we call a Bohr set B = Bp{r), where Tk{B) = k, regular if 

^^||^<|B„Wr)l<(i + ioo«=W)|fl| 

whenever |?7| < 1/lOOA;. Given any Bohr set B we can always find a regular Bohr set 
that closely approximates it. The following is given as Lemma 4.25 in [21]. 

Lemma 6.1. Given a Bohr set Bp{r) there is e E [1/2, 1) such that B^p{r) is regular. 

Any statement which relies on the subspace structure of Bohr sets B in the previous 
arguments can then be replaced with an approximate version, incurring an error 
term dependent on the width parameter p, which can be controlled by choosing p 
sufficiently small. In many cases this means p must depend on a, which results in 
the extra logarithmic factors in Theorem 1.1. For example, the appeal to the identity 
{A, B * f3') = apciB) in the proof of Lemma 2.3 can be replaced by an appeal to the 
following lemma. 

Lemma 6.2. If B = Bp{r) is a regular Bohr set with rank k and B' C B^p(V) for 
some e <C 1/k then for any / : G — )■ C with \\f\\oo < 1 we have 

(/, 5 */?')=( ]g fix) + 0{ek)) pg{B). 
V xeB / 

Proof. Since supp(-B * f3') = B -\- B' C B(^i^^)p(T), we have 

\G\{f,B*P')= Yl f{x)B*P\x). 

2;GB(i+,)p(r) 

Furthermore, whenever x G -B(i„e)p(r)) we have B' + x G B and hence B * /3'(x) = 1. 
It follows that the sum above is 

E/(^)+ E f{x){B*P'{x)-l)+ Y. f{x)B*P\x). 
xeB xei3\B(i_,)p(r) xeB(i+,)p(r)\i? 

By regularity, however, in each of the second sums there are 0{ek)\B\ summands, 
and the result follows by the triangle inequality. D 

We restrict ourselves to stating the main tools needed for the proof of Theorem 
1.1, which closely follows the proof of Theorem 1.2. Making the required technical 
changes is a simple but lengthy matter, and is easily done by comparing the proofs 
given in the rest of this paper to the proofs for the s = 3 and Z/A^Z case given in 
[18], which works in this more complicated setting from the beginning. 

The following are the analogues of Theorems 3.1 and 4.2 (and also Proposition 4.1 
and Corollary 5.2 from [ i -]) respectively. Note that the only change is that we restrict 
certain sets to be inside a sub-Bohr set of the main Bohr set, with width suitably 
restricted. 
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Theorem 6.1. Let Bp(r) and B' = Bpi{r') be regular rank d Bohr sets such that 
B' C B^p{V). Suppose further that Ai <Z B and A2, . . . , Afc+i C B' all with density at 
least a. Finally, suppose e < cai/d and e' < ca/d for some sufficiently small absolute 
constant c > 0. Then either 

(1) there exists a regular Bohr set B" C B' such that 

rk{B") < rk{B) + 0{a~^^'' \og{l/ a)), 



P'{B") > ' ""' 



C(d+a//'''log(l/a)) 



^(ilog(l/a)_ 

and, for some x E B, we have (5"{Ai + x) > 2ai, or 

(2) there are sets L C B and Si, . . . , Sk C B" with A ^ 1 and at > a*^"! for 
1 < i < k such that 

L * Si* ■ ■ ■ * Sk{x) < a{^Ai * y42 * ■ ■ ■ * y4fc+i(a;) 

for all X (z G, where all convolutions are taken over the measure /3' . 

Theorem 6.2. Let Bp{r) and B' be regular rank d Bohr sets such that B' C B^piV) 
and suppose A,L G B and Si, . . . ,Sk C B' . Suppose further that e < cXa/d for some 
sufficiently small absolute constant c > 0. Then either 

(1) {L* Si* ■■■* Sk, A)p > AcTi • • • aka, or 

(2) for any integer I there exists a regular Bohr set B" C B' and an m satisfying 

m < A-2-^/''a-^/2'/' log(l/a) log(l/(T,.) 

with rank rk{B") < rk(B) +m, density (3'{B") > (1/dm)'^^^^"^^ and, for some 
X e B, we have P"{A + x) > a(l + cA). 

The following is the main density increment lemma needed to prove Theorem 1.1, 
and is proved by combining Theorems 6.1 and 6.2 as in the proof of the analogous 
Lemma 5.1. 

Lemma 6.3. Let Bp{r) and B' = Bp'(r') such that B' C B^p(T). Suppose further 
that Ai,As C B and A2, . . . , As_i C B' all with density at least a. Finally, suppose 
that e < c/dinm{ai,as) and e' < ca/d for some sufficiently small absolute constant 
c > 0. Then either 

(1) {Ai*A2---* As-i, As)p > a^"i"'''''"''a,, or 

(2) we have a Bohr set B" C B' such that 

rk{B") < rk{B) + 0{a^^^'^''^^ log^(l/««) log(l/a)), 

^ C(d+a-l/(^-2) log3{l/«.) log(l/a)) 

P'{B") > • ' 



^d\og{l/a) 

such that for some x G B and i G {1, s}, l3"{Ai + x) > a(l + c) for some 
absolute constant c > 0. 

Theorem 1.1 is then proved by repeated applications of Lemma 6.3, following the 
strategy of the proof of Theorem 5.1. 
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